A Parallel-Vector Algorithm for Rapid Structural Analysis on High-Performance Computers
نویسندگان
چکیده
A fast, accurate Choleski method for the solution of symmetric systems of linear equations is presented. This direct method is based on a variable-band storage scheme and takes advantage of column heights to reduce the number of operations in the Choleski factorization. The method employs parallel computation in the outermost DO-loop and vector computation via the "loop unrolling" technique in the innermost DO-loop. The method avoids computations with zeros outside the column heights, and as an option, zeros inside the band. The close relationship between Choleski and Gauss elimination methods is examined. The minor changes required to convert the Choleski code to a Gauss code to solve non-positive-definite symmetric systems of equations are identified. The results for two large-scale structural analyses performed on supercomputers, demonstrate the accuracy and speed of the method.
منابع مشابه
Parallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers
This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...
متن کاملLinear Static Structural and Vibration Analysis on High-Performance Computers
Parallel computers offer the opportunity to significantly reduce the computation time necessary to analyze large-scale aerospace structures. This paper presents algorithms developed for and implemented on a massively-parallel computers hereafter referred to as Scalable High Performance Computers (SHPC) for the most computationally intensive tasks involved in structural analysis, namely, generat...
متن کاملHigh performance computing for wavelet and wavelet packet image coding
The use of high performance computers for wavelet and wavelet packet based image coding is discussed. After a short description of wavelet and wavelet packet methods the existing literature concerning vector, parallel and VLSI wavelet transforms is reviewed. In the following an algorithm for wavelet packet best basis selection on moderate parallel MIMD architectures is introduced and an impleme...
متن کاملNew Fast Algorithms for First-Order Linear Recurrences on Vector Computers
We examine the performance of parallel algorithms for rst-order linear recurrence on vector computers, evaluate them quantitatively on a simple model of vector computers, and propose new fast algorithms. We also show a result of performance benchmarking of them on actual vector computers.
متن کاملA High-Performance FFT Algorithm for Vector Supercomputers
Many traditional algorithms for computing the fast Fourier transform (FFT) on conventional computers are unacceptable for advanced vector and parallel computers because they involve nonunit, power-of-two memory strides. This paper presents a practical technique for computing the fast Fourier transform that completely avoids all such strides and appears to be near-optimal for a variety of curren...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1990